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IN THE CLAIMS: 

Please cancel claims 14, 19 and 22, and amend the claims as follows: 

1 . (Currently Amended): A distributed data cache coupled to one of a plurality 
of reconfigurable execution nodes (RXN), wherein the hotorogonoous proc e ss i ng 
reconfiqurable execution nodes are coupled by a programmable interconnection 
network having a network root and a plurality of crosspoint switches, each of the RXNs 
being selectively configurable to perform a one of a plurality of processing operations. 
the distributed data cache comprising: 

a plurality of cache memory units each having a plurality of cache ports; 

a plurality of data buses connected with each of the cache memory units, 
wherein each of the plurality of data buses is connected with one of the plurality of 
cache ports of each of t he cache memory units;[[and]] 

the reconfigurable execution node adapted for actively processing data 
and having at least one data input and at least one data output, wherein the data input 
and data output are connected with the plurality of data buses[[.11 ; and 

wherein the RXN is adapted to be reconfigured to perform a sequence of 
operations to actively process data responsive to a sequence of the control words 
stored in the cache memory. 

2. (Cancelled) 

3. (Previously Presented): The distributed data cache of Claim 1 , further 
comprising a multiplexer for alternately connecting the data input with each of the 
plurality of data buses. 

4. (Previously Presented): The distributed data cache of Claim 1 , further 
comprising a multiplexer for alternately connecting the data ouput with each of the 
plurality of data buses. 
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5. (Previously Presented): The distributed data cache of Claim 1 , further 
comprising a plurality of data address generators connected with a memory unit and the 
plurality of data buses. 

6. (Previously Presented): The distributed data cache of Claim 5, wherein the 
plurality of data address generators are adapted to retrieve a plurality of data values 
from the memory unit and communicate the plurality of data values to the plurality of 
data buses. 

7. (Original): The distributed data cache of Claim 6, wherein the plurality of data 
address generators are adapted to simultaneously communicate the plurality of data 
values to the plurality of data buses, wherein each of the plurality of data value is 
communicated to a different one of the plurality of data buses. 

8. (Original): The distributed data cache of Claim 7, wherein the first cache 
memory unit Is adapted to simultaneously load a plurality of data values from the 
plurality of data buses, such that each of the plurality of data values is loaded in a 
different one of the plurality of cache lines of the first cache memory unit through the 
same port. 

9. (Original): The distributed data cache of Claim 1 , wherein the number of 
cache lines of the first cache memory unit are equal to the number of data buses. 

10. (Original): The distributed data cache of Claim 1 , further comprising at least 
one additional cache memory unit also having a plurality of cache lines, wherein each 
cache line of the additional cache memory unit is connected with the plurality of data 
buses. 

1 1 . (Original): The distributed data cache of Claim 1 0, wherein the total number of 
cache memory units is equal to the number of cache lines in each cache memory unit. 
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12. (Currently Amended): An apparatus for transferring a plurality of data values 
arranged in a matrix, the apparatus coupled to one of a plurality of heterogeneous 
processing nodes, wherein the heterogeneous processing nodes are coupled by a 
programmable interconnection network having a network root and a plurality of 
crosspoint switches, the apparatus comprising: 

a plurality of cache memory units, each cache memory unit having a 
plurality of cache ports; and 

a plurality of data buses, each data bus connected with a different one of 
the plurality of cache ports from each of the cache memory or register file units so that 
each of the data buses is connected to one port of every one of the data buses . 

1 3. (Currently Amended): The apparatus of Claim 1 2, further comprising a 
plurality of data address generators adapted to retrieve a plurality of sets data values 
from the memory unit and communicate the plurality of sets data values to the plurality 
of data buses, the apparatus being configured to transpose a matrix stored in the main 
memory by reading a matrix comprising a plurality of row elements from the main 
memory into the plurality of data buses, then simultaneously transferring the row 
elements to the cache ports of each o ne of the cache memories, and repeating the 
operation for each row of the stored matrix. 

14. (Cancelled) 

1 5. (Currently Amended): The apparatus of Claim [[141 1 12 . wherein the plurality 
of data address generators are adapted to sequentially communicate the plurality of 
sets of data values with the plurality of data busesfr.l V. and 

wherein each and everv of the cache memorv units is adapted to simultaneously 
load one of the sets of data values from the plurality of data buses, such that each data 
value of the set of data values is simultaneously loaded in a different one of the plurality 
of cache orts of the cache memorv unit or register file. 



16-19. (Cancelled) 
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20. (Previously Presented): A method for transferring a plurality of data values 
arranged in a matrix to transpose the matrix, the matrix including a plurality of 
heterogeneous processing nodes coupled by a programmable interconnection network 
having a network root and a plurality of crosspoint switches, the method comprising: 

retrieving a first subset of data values from the plurality of data values 
from a memory unit storing the matrix in rows; 

simultaneously transferring the first subset of data values comprising one 
of the rows of the matrix; to a plurality of data buses, wherein each data value of the first 
subset is transferred to a different one of the plurality of data buses; 

simultaneously loading the first subset of data values from the plurality of 
data buses to a first cache memory unit having a plurality of cache ports, wherein each 
cache port receives a data value from a different one of the plurality of data buses to 
define a column of the matrix; and 

repeating the operation for each row of the matrix to complete transposing 

of the matrix. 

21 . (Previously Presented): The distributed data cache of claim 1 , wherein the 
RXN includes a plurality of selectively configurable elements each having a 
configuration determined by the value of a multi-bit control word, the bits being 
simultaneously supplied In one or more of the control words stored in the cache memory 
units over the data buses. 

22. (Cancelled) 

23. (Previously Presented): The distributed data cache of claim 22, wherein each 
of the memory units comprises a plurality of cache lines, each cache line being 
associated with one of the plurality of cache ports, each cache line storing one of the 
control words, the sequence of control words being retrieved by an index uniquely 
identifying the cache line, a sequence of the indexes defining a single access control 
word. 
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24. (Previously Presented): The distributed data cacfie of claim 23, wherein the 
access control world is identified by a flag bit, the remainder of the access control word 
comprising a sequence of indexes. 
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